10 research outputs found
Accurate Profiling of Microbial Communities from Massively Parallel Sequencing using Convex Optimization
We describe the Microbial Community Reconstruction ({\bf MCR}) Problem, which
is fundamental for microbiome analysis. In this problem, the goal is to
reconstruct the identity and frequency of species comprising a microbial
community, using short sequence reads from Massively Parallel Sequencing (MPS)
data obtained for specified genomic regions. We formulate the problem
mathematically as a convex optimization problem and provide sufficient
conditions for identifiability, namely the ability to reconstruct species
identity and frequency correctly when the data size (number of reads) grows to
infinity. We discuss different metrics for assessing the quality of the
reconstructed solution, including a novel phylogenetically-aware metric based
on the Mahalanobis distance, and give upper-bounds on the reconstruction error
for a finite number of reads under different metrics. We propose a scalable
divide-and-conquer algorithm for the problem using convex optimization, which
enables us to handle large problems (with species). We show using
numerical simulations that for realistic scenarios, where the microbial
communities are sparse, our algorithm gives solutions with high accuracy, both
in terms of obtaining accurate frequency, and in terms of species phylogenetic
resolution.Comment: To appear in SPIRE 1
Recommended from our members
Dietary fat promotes antibiotic-induced Clostridioides difficile mortality in mice
Clostridioides difficile infection (CDI) is the leading cause of hospital-acquired diarrhea, and emerging evidence has linked dietary components with CDI pathogenesis, suggesting that dietary modulation may be an effective strategy for prevention. Here, we show that mice fed a high-fat/low-fiber “Western-type” diet (WD) had dramatically increased mortality in a murine model of antibiotic-induced CDI compared to a low-fat/low-fiber (LF/LF) diet and standard mouse chow controls. We found that the WD had a pro- C. difficile bile acid composition that was driven in part by higher levels of primary bile acids that are produced to digest fat, and a lower level of secondary bile acids that are produced by the gut microbiome. This lack of secondary bile acids was associated with a greater disturbance to the gut microbiome with antibiotics in both the WD and LF/LF diet compared to mouse chow. Mice fed the WD also had the highest level of toxin TcdA just prior to the onset of mortality, but not of TcdB or increased inflammation. These findings indicate that dietary intervention to decrease fat may complement previously proposed dietary intervention strategies to prevent CDI in high-risk individuals. © 2022, The Author(s).Open access journalThis item from the UA Faculty Publications collection is made available by the University of Arizona with support from the University of Arizona Libraries. If you have questions, please contact us at [email protected]
RNASTAR: An RNA STructural Alignment Repository that provides insight into the evolution of natural and artificial RNAs
Automated RNA alignment algorithms often fail to recapture the essential conserved sites that are critical for function. To assist in the refinement of these algorithms, we manually curated a set of 148 alignments with a total of 9600 unique sequences, in which each alignment was backed by at least one crystal or NMR structure. These alignments included both naturally and artificially selected molecules. We used principles of isostericity to improve the alignments from an average of 83%-94% isosteric base pairs. We expect that this alignment collection will assist in a wide range of benchmarking efforts and provide new insight into evolutionary principles governing change in RNA structural motifs. The improved alignments have been contributed to the Rfam database